Credit Scoring in the Era of Big Data
نویسندگان
چکیده
For most Americans, access to credit is an essential requirement for upward mobility and financial success. A favorable credit rating is necessary to purchase a home or car, to start a new business, to seek higher education, or to pursue other important goals. For many consumers, strong credit is also necessary to gain access to employment, rental housing, and essential services such as insurance. At present, however, individuals have very little control over how they are scored and have even less ability to contest inaccurate, biased, or unfair assessments of their credit. Traditional, automated creditscoring tools raise longstanding concerns of accuracy and unfairness. The recent advent of new "big-data" credit-scoring products heightens these concerns. The credit-scoring industry has experienced a recent explosion of start-ups that take an "all data is credit data" approach, combining conventional credit information with thousands of data points mined from consumers' offline and online activities. Big-data scoring tools may now base credit decisions on where people shop, the purchases they make, their online social media networks, and various other factors that are not intuitively related to creditworthiness. While the details of many of these products remain closely guarded trade secrets, the proponents of big-data credit scoring argue that these tools can reach millions of underserved consumers by using complex algorithms to detect patterns and signals within a vast sea of information. While alternative credit scoring may ultimately benefit some consumers, it also poses significant risks. * Law Clerk to the Honorable Roy W. McLeese III, District of Columbia Court of Appeals, J.D., 2015, Georgetown University Law Center. ** S.M. Computer Science & Technology and Policy, MIT 2016. This views expressed in this article are solely those of the authors and do not represent the opinions of their respective institutions. The authors would like to thank Taesung Lee for helpful comments and suggestions on earlier drafts. The authors also wish to thank Professors David Vladeck and Alvaro Bedoya of Georgetown Law, and Professors Danny Weitzner and Hal Abelson of MIT for their extensive guidance in the development of this article. All errors remain those of the authors alone. 1 Hurley and Adebayo: CREDIT SCORING IN THE ERA OF BIG DATA Published by Yale Law School Legal Scholarship Repository, Credit Scoring & Big Data Credit-scoring tools that integrate thousands of data points, most of which are collected without consumer knowledge, create serious problems of transparency. Consumers have limited ability to identify and contest unfair credit decisions, and little chance to understand what steps they should take to improve their credit. Recent studies have also questioned the accuracy of the data used by these tools, in some cases identifying serious flaws that have a substantial bearing on lending decisions. Big-data tools may also risk creating a system of "creditworthiness by association" in which consumers' familial, religious, social, and other affiliations determine their eligibility for an affordable loan. These tools may furthermore obscure discriminatory and subjective lending policies behind a single "objective" score. Such discriminatory scoring may not be intentional; instead, sophisticated algorithms may combine facially neutral data points and treat them as proxies for immutable characteristics such as race or gender, thereby circumventing existing non-discrimination laws and systematically denying credit access to certain groups. Finally, big-data tools may allow online payday lenders to target the most vulnerable consumers and lure them into debt traps. Existing laws are insufficient to respond to the challenges posed by credit scoring in the era of big-data. While federal law prohibits certain forms of discrimination in lending and ensures that consumers have limited rights to review and correct errors in their credit reports, these laws do not go far enough to make sure that credit-scoring systems are accurate, transparent, and unbiased. Existing laws also do little to prevent the use of predatory scoring techniques that may be geared to target vulnerable consumers with usurious loans. This article, which has been developed as part of a collaborative effort between lawyers and data scientists, explores the problems posed by big-data credit-scoring tools and analyzes the gaps in existing laws. It also sets out a framework for comprehensive legislative change, proposing concrete solutions that would promote innovation while holding developers and users of credit-scoring tools to high standards of accuracy, transparency, fairness, and non-discrimination.
منابع مشابه
Credit scoring in banks and financial institutions via data mining techniques: A literature review
This paper presents a comprehensive review of the works done, during the 2000–2012, in the application of data mining techniques in Credit scoring. Yet there isn’t any literature in the field of data mining applications in credit scoring. Using a novel research approach, this paper investigates academic and systematic literature review and includes all of the journals in the Science direct onli...
متن کاملInvestigating the missing data effect on credit scoring rule based models: The case of an Iranian bank
Credit risk management is a process in which banks estimate probability of default (PD) for each loan applicant. Data sets of previous loan applicants are built by gathering their data, and these internal data sets are usually completed using external credit bureau’s data and finally used for estimating PD in banks. There is also a continuous interest for bank to use rule based classifiers to b...
متن کاملUsing the Hybrid Model for Credit Scoring (Case Study: Credit Clients of microloans, Bank Refah-Kargeran of Zanjan, Iran)
In any country, commercial banks lay the groundwork for economic growth by collecting national resources and capitals and allocating them to different economic sectors. Optimal allocation of resources is especially important in achieving this goal. Banks with an effective and dynamic system of customer assessment can efficiently allocate their resources to customers regardless of their geograph...
متن کاملUsing DEA for Classification in Credit Scoring
Credit scoring is a kind of binary classification problem that contains important information for manager to make a decision in particularly in banking authorities. Obtained scores provide a practical credit decision for a loan officer to classify clients to reject or accept for payment loan. For this sake, in this paper a data envelopment analysis- discriminant analysis (DEA-DA) approach is us...
متن کاملImproved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring
In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...
متن کامل